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Proteomics was used to identify a protein encoded by ORF 3a in a SARS- 
associated coronavirus (SARS-CoV). Immuno-blotting revealed that inter- 
chain disulfide bonds might be formed between this protein and the 
spike protein. ELISA indicated that sera from SARS patients have signifi- 
cant positive reactions with synthesized peptides derived from the 3a pro- 
tein. These results are concordant with that of a spike protein-derived 
peptide. A tendency exists for co-mutation between the 3a protein and 
the spike protein of SARS-CoV isolates, suggesting that the function of 
the 3a protein correlates with the spike protein. Taken together, the 3a pro- 
tein might be tightly correlated to the spike protein in the SARS-CoV func- 
tions. The 3a protein may serve as a new clinical marker or drug target for 
SARS treatment. 
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Introduction 


Coronaviruses are a large and diverse family of 
enveloped, positive-stranded RNA viruses. Mem- 
bers of the coronavirus family can be divided into 
three groups according to serologic properties.’ 
Although an individual coronavirus naturally 
infects distinct animal hosts and causes particular 
diseases, the whole-genome sequencing of several 
coronaviruses such as avian infectious bronchitis 
virus (IBV), murine hepatitis virus (MHV), human 
coronavirus 229E (HCoV-229E), and porcine trans- 
missible gastroenteritis virus (TGEV), revealed 
similar genomic structure among these different 
coronaviruses.*~° 

A novel coronavirus has recently been identified, 
which causes the latest worldwide outbreak of 
severe acute respiratory syndrome (SARS). The 
analysis of the complete nucleotide sequence of 
SARS-associated coronavirus (SARS-CoV) shows 
that its genomic organization is similar to that of 
other known coronaviruses.°” The genome of 
SARS-CoV is approximately 30 kb in size and has 
14 open reading frames (ORFS).°* 

It looks easy to analyze the entire genome of 
coronaviruses, but their protein component identi- 
fication has proven to be a difficult task. So far, 
only five types of structural proteins have been 
identified as members of the coronavirus family.’ 
The spike (S) glycoprotein, together with small 
envelope (E) protein and membrane (M) glyco- 
protein, consist of the viral envelope, while the 
nucleocapsid (N) protein interacts with genomic 
RNA of the virus to form the viral nucleocapsid.'~° 
In addition, a hemagglutinin-esterase (HE) glyco- 
protein has been found in the virion surface of 
some coronaviruses such as turkey coronavirus.’ 
The analysis of mRNAs of SARS-CoV revealed 
many transcription units covering unknown ORFs 
of the genome.®*'® Although some researchers 
speculated that these unknown ORFs encode non- 
structural proteins,’ there is no evidence to support 
this assumption. 

Analysis of the genomic organization of SARS- 
CoV showed that a gene locus containing ORF3a 
and 3b located between the S and E genes,°-® is 
frequently found in different members of the 
coronavirus family (see Figure 4A). It has been 
hypothesized that ORF 3/3a and ORF3-1/3b 
may be related to viral virulence and 
pathogenesis,"'~'* whereas some suggested that 
ORF3a of TGEV is not essential for enteric 
virulence.'* However, these previous studies are 
not conclusive, since all data are based on the 
analysis of genomic sequences or genes. Here, 
we detected with proteomics a protein derived 
from ORF3a of the SARS-CoV genome. Immuno- 
blotting revealed interchain disulfide linkages 
between the spike protein and the 3a protein. 
Mutation analysis of SARS-CoV isolates and 
ELISA of SARS patients’ sera indicated that the 
ORF3a product might function together with the 
S protein in vivo. 


Results 


Proteomic identification of ORF3a product 
in SARS-CoV 


The “shot-gun” strategy using two-dimensional 
liquid chromatography electrospray tandem mass 
spectrometry (2D-LC-MS/MS) is a newly devel- 
oped proteomic approach in which a protein mix- 
ture of adenovirus can be directly analyzed with 
mass spectrometry without separation by two- 
dimensional gel electrophoresis.’ In the present 
study, the cytosol of Vero E6 cells infected with 
SARS-CoV was collected and subjected to 2D-LC- 
MS/MS. The tryptic-digested peptides of 5S, M, N, 
E and other known proteins of SARS-CoV were 
identified (not shown). In addition, two novel pep- 
tides were detected, corresponding to the residues 
180-193 and 182-193 of a predicted ORF3a protein, 
respectively (Table 1; Supplementary Material). 

According to the genomic sequence information 
for SARS-CoV, ORF 3a encoded a putative 31 kDa 
protein containing 274 amino acid residues.®” In 
order to confirm the result by shot-gun assay, the 
collected cytosol of SARS-CoV-infected Vero E6 
cells was separated with SDS-PAGE, and then the 
gel slice around the 31 kDa region was excised 
and analyzed by capillary liquid chromatography 
electrospray tandem mass spectrometry (wLC-ESI- 
MS/MS). The MS results showed four peptides of 
this ORF 3a protein including one N-terminal pep- 
tide from position 7-19, one peptide from 180- 
193 and two identical C-terminal peptides from 
236-274, encompassing 24.08% of the whole 3a 
protein sequence (Table 1; Supplementary Material). 
The results from two complementary proteomic 
studies indicate the existence of a 31 kDa protein 
encoded by ORF 3a of SARS-CoV. 


The 3a protein sequence characterization 


The predicted amino acid sequence of the 3a 
protein of SARS-CoV isolate BJ-01 is shown in 
Figure 1}. This information combined with 
DNASTAR analysis (DNASTAR Inc. USA) pre- 
dicted the existence of three transmembrane 
regions of the 3a protein, which is in agreement 
with that of the Tor2 isolate.’ The membrane top- 
ology of this protein might belong to type 3b: its 
N terminus (about 40 amino acid residues) is on 
the outer surface of the viral particle, which 
contains a putative signal peptide from residues 1 
to 15 predicted by SignalP.'”’* Its C-terminal 
domain of 160 amino acid residues is located in 
the interior of the viral particle, of which there is a 
potential cytoplasmic region of Ca ATPase (GI: 
737940) from residues E209 to D264 (also see 
Marra et al.’). 


+ TMPRED* as well as ALOM available at http:// 
psort.nibb.ac.jp/; TMHMM available at http://www. 
cbs.dtu.dk/services/TMHMM/ 
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We noticed that a cysteine-rich region overlaps 
the third membrane-spanning domain and the 
cytoplasmic tail of the 3a protein (Figure 1). More- 
over, three cysteine residues within this cysteine- 
rich region are arranged every three residues 
(CWLCWKC from residues (C127 to C133, 
Figure 1). If these cysteine residues exist in one 
helix, they would be located in the same surface, 
suggesting that the 3a protein containing these 
cysteine residues is able to form interchain disul- 
fide linkages with other viral structural proteins. 
Interestingly, comparison with S protein sequences 
of the coronavirus family also shows a cysteine- 
rich region overlapping the junction of the 
membrane-spanning domain and the cytoplasmic 
tail, which is a conserved motif of spike proteins 
in all analyzed coronaviruses (see Figure 3 of 
Marra et al.’). This feature is not seen in other 
known structural proteins of SARS-CoV. Therefore, 
we hypothesize that the protein encoded by 
ORF3a forms the interchain disulfide bonds with S 
protein of SARS-CoV though their “cysteine-rich” 
regions. 


Formation of interchain disulfide linkages 
between the 3a protein and spike glycoprotein 


In order to address whether the 3a protein forms 
interchain disulfide bonds with S protein, we gen- 
erated a polyclonal antibody against a peptide 
(3a1) derived from the cytoplasmic region of the 
3a protein of SARS-CoV (see Materials and 
Methods). Western blotting detected 3a proteins in 
the cytosol of infected Vero E6 cells (Figure 2A, 
lane 2), which agrees with the results of 2D-LC- 
MS/MS analysis. Moreover, the 3a protein was 
also detected in the crude virions of SARS-CoV 
(Figure 2A, lane 3), suggesting that the 3a protein 
might be a_ structural protein of SARS-CoV, 
although further validation is required employing 
more purified SARS-CoV virions. 

If according to sequence analysis, the 3a protein 
forms interchain disulfide linkages with S protein, 
a protein complex would be formed of around 
210 kDa in SARS-CoV virions. To test this hypo- 
thesis, the crude virions were analyzed by 


immuno-blotting under both reducing and non- 
reducing conditions. A polyclonal antibody against 
a peptide (S2) derived from the sequence of S pro- 
tein detected two bands around 210kDa and 
180 kDa under non-reducing conditions (Figure 
2B, lane 2 of left panel), whereas only one band 
corresponding to 180kDa was detected under 
reducing conditions (Figure 2B, lane 1 of left 
panel). Furthermore, the immuno-blotting with 
the anti-3al antibody also showed a band at 
210kDa under non-reducing conditions (Figure 
2B, lane 2 of right panel). In order to further con- 
firm the observations, the antibodies were stripped 
out from these blotted PVDF membranes, and then 
the membranes were re-probed by the exchange of 
the antibodies (S2 for 3a1 and 3al for 52, respect- 
ively). The same patterns were detected again (not 
shown). These results suggest that the 3a protein 
binds to the spike protein with the interchain disul- 
fide bonds on the interior side of the viral envelope 
of SARS-CoV. Since two bands corresponding to S 
protein co-existed in the gel under the non-redu- 
cing conditions (Figure 2B, lane 2 of left panel) 
and no band was detected by the anti-3a1 antibody 
at 31 kDa in the gel under the non-reducing con- 
ditions (not shown), it suggests that all molecules 
of the 3a protein are used for the formation of 
interchain disulfide linkages with the S proteins of 
SARS-CoV. 


Functional correlation of the 3a protein with 
spike glycoprotein 


If the 3a protein binds to S protein with disulfide 
bonds, it is possible that the 3a protein-specific 
antibodies may be detected in the sera of SARS 
patients. To address this question, two synthetic 
peptides, 3a1 and 3a2 corresponding to the cyto- 
plasmic region and the extracellular region of the 
3a protein, respectively, were used as antigens to 
probe for 3a protein-specific antibodies in the sera 
from clinically diagnosed SARS patients. ELISA 
results showed that the SARS patients could be 
divided into two groups corresponding to high 
responders and low responders based on the A 
values, whereas the control produced basal low 


Table 1. Identified peptides of the 3a protein of SARS-CoV with ESI-MS/MS 


Peptide sequence Calculated Observed Observed Delta Peptide 
Position [M+ H]* [M + H]* charge Xcorr CN (Hits) 
LKEDYQIGGYSEDR?* 180-193 1673.76 1673.71 3 4.20 0.78 27/52 
EDYQIGGYSEDR* 182-193 1432.43 1432.12 2 2.44 0.64 13/22 
FFTLGSITAQPVK 7-19 1409.66 1409.86 2 3.30 0.90 18/24 
LKEDYQIGGYSEDR 180-193 1673.76 1674.28 2 4.93 0.84 22/26 
LV KDPPN V QIHTIDGSSGV ANP 236-274 4092.54 4092.52 3 4.20 0.84 26/152 
AMDPIYDEPTTTTSVPL 
LV KDPPNV QIHTIDGSSG V ANP 236-274 4108.53 4108.00 3 3.86 0.95 26/152 
AMP?DPIYDEPTTTTSV PL 


The MS/MS spectra of the peptides are given in the Supplementary Material. 


* Indication of peptides identified with the shotgun strategy. 
» Indication of oxidised Met. 
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MDLFMRFFTL GSITAQPVKI DNASPASTVH ATATIPLQAS LPFGWLVIGV AFLAVFOSA T KILALNKRWQ 70 
. 


LALYKGFQFI CNLLLLFVTI YSHLLLVAAG WEAQFLYLYA LIYFLQCINA CRIIMRCWLC WKCKSQNPLL 140 


a a a a a a 
YDANYFVCWH THNYDYCIPY NSYTDTIVVT £GDGISTPKL KEDYQIGGYS EDAHSGVKDY VV¥VHGYFTEV 210 
a a 
YYQLESTOIT TOTGIENATF FIFNKLVKDP PNV QIHTIDG SSGVANPAMD PIYDEPTTTT SV PL 24 


Figure 1. Sequence analysis of the 3a protein of SARS-CoV. The ORF 3 gene encodes a putative protein of 274 amino 
acid residues. Predicted transmembrane regions are demarcated by arrows and the direction is from the outer surface 
to the interior of virion envelope. Cysteine residues are indicated with triangles and the cysteine-rich region is under- 
lined. Amino acid residues in italics have mutations among the isolates of SARS-CoV, especially G marked with dots 


has two different mutations. 


levels of antibodies against 3a1 and 3a2 (Figure 3A 
and B). As a negative control, a peptide derived 
from mice IL-12 82 receptor was used and no pep- 
tide-specific antibodies were detected in all nega- 
tive controls and SARS patients (no shown). The 
results suggest that when the 3a protein was 
exposed to patients during SARS-CoV infection, 
3a protein-specific antibodies were produced. 

Since S protein is a major structural protein of 
SARS-CoV, the peptide-specific antibodies against 
S protein in the patients’ sera should also be 
detected by the same approach. The same sera- 
samples of SARS patients that were analyzed with 
3a2 peptide, were measured again with S1 peptide 
derived from the extracellular region of S protein. 
As expected, the pattern of ELISA assay with S1 
peptide was similar to that of 3a1 and 3a2 peptides 
(Figure 3C). In addition, statistical analysis of the 
correlation between the responder of S1 and 3a2 
with SigmaPlot (SPSS Inc. USA) indicated that the 
response of the 3a2 was associated with that of S1 
(r = 0.60, P < 0.01; Figure 3D). Our data strongly 
suggest that the 3a protein and S protein are 
equally exposed to the human immune system, 
and then the induced antibodies are expressed at 
comparable levels to each other in the same SARS 
patient. 

Until now, there have been 60 complete or partial 
genomic sequences of SARS-CoV isolates from 
human and from four available Himalayan palm 
civets. According to these sequencing data, we 
analyzed the 3a protein mutations of these isolates. 
The results showed that the 3a protein sequences 
of 48 SARS-CoV isolates were identical, which is 
referred to as the wild-type, whereas the rest of 
the isolates showed different mutations on their 3a 
protein sequences, of which 15 isolates showed 
point-mutations, including six synonymous 
mutations, and one isolate showed a frame-shift 
mutation. Interestingly, the analysis of S protein 
sequences showed that the sequences of 44 isolates 
were identical, of which 43 isolates also belong to 
the “wild-type” of the 3a protein, whereas another 
19 isolates had mutations on their spike sequences. 
Statistical analysis with SigmaPlot indicated that 
the mutations of the 3a protein were associated 
with that of S protein (r = 0.79, P < 0.05). We did 
not find similar correlation with other structural 
proteins of SARS-CoV isolates (not shown). 


Further analysis indicated that the mutation pat- 
terns in the S protein sequences from Himalayan 
palm civets were identical only in three human 
SARS-CoV isolates (GD01, AY278489.2; GZ43, 
AY304490.1; GZ60, AY304491.1), whereas the same 
mutant site (19 t— a) of 3a protein sequences 
from Himalayan palm civets was also only found 
in those three human SARS-CoV isolates. This 
kind of mutation-correlation between S protein 
and 3a protein suggests that there is some relation- 
ship between the SARS-CoV isolates from Himala- 
yan palm civet and those from human being, at 
least, GD01, GZ43 and GZ60. Taken together, these 
results suggest that the biological function of the 
3a protein is tightly correlated with S protein of 
SARS-CoV. 


Phylogenetic analyses of the sequence of SNE 
locus of coronaviruses 


It has been shown that ORF 3a is located within 
the region between the S and E genes in the 
SARS-CoV genome.°’” By analyzing the genome 
organizations of other known coronaviruses, we 
found that the pattern of a potential gene locus 
between the S and E genes is conserved in all 
three groups of coronavirus genomes (Figure 4A). 
Therefore, we designate this locus as SNE (5S 
neighbor E). In general, the SNE locus consists of 
one to three ORFs in different species of corona- 
viruses, which encodes putative proteins contain- 
ing at least one predicted transmembrane region 
(Figure 4A). 

Figure 4A shows that only the genomes of SARS- 
CoV and PEDV have one complete ORF in the SNE 
locus, whereas the putative genes corresponding to 
ORF 3 in SNEs of five other coronaviruses are trun- 
cated and short ORFs are produced due to a 
mutation. However, if the entire sequence of the 
SNE locus was used for phylogenetic analysis, the 
topology of the resulting phylograms of SNE 
would be remarkably similar to that of S proteins 
(Figure 4B).°” In other words, the phylogenetic 
analysis of SNE sequences indicates that SARS- 
CoV is not closely related to any of the three 
known groups of coronaviruses, which is in agree- 
ment with the phylogenetic analyses of all other 
known structural proteins of SARS-CoV.°” 
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Figure 2. Formation of interchain disulfide bonds between the 3a protein and spike protein. A, Identification of the 
3a protein in the cytosol of SARS-CoV-infected Vero E6 cells. Vero E6 cells (lane E6) and the cytosol of infected Vero 
E6 cells (lane E6 + V) were subjected to Western blotting with anti-3al antibody (against the 3a protein). An arrow 
indicates the band of the 3a protein. B, The 3a protein forms interchain disulfide linkages with S protein. The crude 
SARS-CoV virions were lysed in a sample loading buffer either with 100 mM DTT (reducing sample, lane R) or with- 
out DTT (non-reducing sample, lane NR), and then subjected to Western blotting with anti-S2 antibody (against S pro- 


tein) (left panel) or anti-3al antibody (right panel). 


Discussion 


The 3a protein might be a minor 
structural protein 


Though proteomics identified the 3a protein, an 
ORF3a gene product of SARS-CoV, it is a minor 
structural protein on the surface of SARS-CoV 
viral envelope. This is apparent because: (1) Wes- 
tern blotting with anti-3al antibody detected the 


OD Value 


Normal Control 


OD Value 


Normal Control 


High Response Low Response 


3a protein in the crude virions (Figure 2). (2) Since 
the conserved motif of the cysteine-rich region of 
the S protein is located between the end of the 
membrane-spanning region and the cytoplasmic 
tail, this motif may provide a frame for formation 
of interchain disulfide bonds (see Figure 3 of Rota 
et al.°), the spatial orientation of the 3a protein 
might determine the usage of its cysteine-rich 
region with the frame of the S protein in the virus 
envelope interior. (3) It is well known that the 
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Figure 3. Detection of ORF3a peptide-specific IgG antibodies in SARS patients. The IgG antibodies in the sera of 
SARS patients were measured by ELISA with: A, 3a1 peptide; B, 3a2 peptide; C, S1 peptide. The low-response group 
consists of individual patients who had low A values, which were below the average of the normal control, whereas 
the high-response group had A values that were above the average of the normal control. D, The correlation of anti- 
bodies induced by the 3a protein and S protein in the same patient in response to 3a2 and S1-peptides with 


SigmaPlot software. 
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reducing environment of the cytosol in cells 
decreases the likelihood of disulfide bond formation. 
Therefore, the formation of the interchain disulfide 
bonds between the 3a protein and S protein must 
be in the interior of the viral envelope, in which 
there is a non-reducing environment. However, 
it might be necessary to further confirm this 
assumption with the more purified SARS-CoV 
virions, since the crude virions may be contami- 
nated with cellular proteins and non-structural 
viral proteins. 


Cysteine-rich regions may play an important 
role in SARS-CoV 


Spike glycoprotein of coronaviruses contains 
many cysteine residues. In addition, there is a 
cysteine-rich region, overlapping the junction of the 
membrane-spanning region and the cytoplasmic 
region of S proteins, and this conserved structure of 
S protein exists in all three groups of coronaviruses.° 
The molecular function of the cysteine-rich domain 
in S protein is largely unknown. Recently, some 
evidence showed that the cysteine-rich domain of S 
protein was required for coronavirus-induced mem- 
brane fusion.’”*° The sequence analysis showed that 
the 3a protein of SARS-CoV has a similar cysteine- 
rich region that is also located at the junction of the 
putative transmembrane and cytoplasmic region 
(Figure 1). (4) Here for the first time we show that 
the 3a protein might form interchain disulfide 
bonds with S proteins, probably through the 
cysteine-rich domains (Figure 2). Since no such kind 
of cysteine-rich domain exists in any other known 
SNE loci of coronaviruses, interchain disulfide 
bonds might not form between the proteins encoded 
by SNE loci and S protein in other coronaviruses. 


Therefore, we postulate that the interaction between 
the 3a and S proteins occurs through their cysteine- 
rich regions, which is a special case in the corona- 
virus family, which may play an important role in 
affecting SARS-CoV virulence. 


The 3a protein probably evolves from the 
SNE locus 


An obvious question to ask is: what is the origin 
of the 3a protein of SARS-CoV during the evo- 
lutionary process of the coronaviridae family. The 
analysis on the SNE locus may provide some 
clues to answer this question, since the ORF gene 
encoding the 3a protein is located in this locus 
(Figure 4). First of all, the SNE locus exists widely 
in all three groups of coronavirus family besides 
SARS-CoV, which consists of one to three ORFs 
(Figure 4). It is likely that the SNE locus in the 
primitive coronavirus has only one ORF, and 
many short ORFs within the SNE locus may be 
produced in different strains of coronaviruses due 
to high mutation rates in this locus during the evol- 
utionary process. Based on complete genome com- 
parison, phylogenetic analysis of SARS-CoV 
showed that SARS-CoV is closer to group 1 of 
coronaviruses," which is in agreement with our 
phylogenetic tree of the SNE locus (Figure 4B). 
Since it was proposed that PEDV and H229E 
diverged later than TEGV from their common 
ancestor in the group 1 according to the analysis 
of SNE locus,'° we speculate that the 3a protein of 
SARS-CoV evolved from the SNE locus of PEDV 
or a PEDV-like ancestor. 

It is not clear whether the SNE is necessary or 
dispensable for virulence of coronaviruses. It has 
been hypothesized that ORF 3/3a and ORF3-1/3b 
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Figure 4. Analysis of SNE locus in the coronavirus family. A, Comparison of genome organization between S-gene 
and E-gene of coronaviruses. The predicted ORFs between the gene encoding the spike (S) protein and the gene encod- 
ing the small envelope (E) protein of seven species from three groups of coronaviruses were compared. The transmem- 
brane (TM) region was also analyzed. B, Phylogenetic analysis of SNE locus. Unrooted phylogenetic tree of SNE region 
was generated by using CLUSTAL W 1.74 with the BLOSUM comparison matrix, and was drawn with the Phylip 
Drawtree program 3.6a3. Branch lengths indicate the number of substitutions per residue. Numbers indicate bootstrap 


replicates supporting each node. 
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may be related to viral virulence and 
pathogenesis,""~"'* whereas some suggested that 
ORF3a of TGEV was not essential for enteric 
virulence.* According to the data collected by 
ELISA (Figure 3), we postulate that the 3a protein 
plays an important role in the virulence of SARS- 
CoV, although it might not be the same case in 
other coronaviruses. Therefore, the 3a protein may 
serve as a new clinical marker or drug target in 
the treatment of SARS. 


Materials and Methods 


Cell culture and virus infection 


African green monkey kidney cells (Vero E6, ATCC) 
were maintained in Dulbecco’s modified Eagle’s med- 
ium (DMEM, Gibco-BRL) supplemented with 10% (v/v) 
fetal bovine serum (FBS, Gibco-BRL) at 37°C in a 5% 
(v/v) CO, atmosphere. 

For virus infection, Vero E6 cells were treated for one 
hour with the DMEM (2% FBS) containing SARS-CoV 
virions (BJ-01 isolate, provided by Academy of Military 
Medical Sciences). The virus-medium was removed 
after the infection and the infected cells were cultured 
in DMEM with 2% FBS at 37°C in 5% CO. After 48 
hours post-infection, the supernatant medium of the 
infected cells was collected and centrifuged at 
12,000 rpm for 30 minutes (Eppendorf 5415D) to elimin- 
ate the cell debris as crude SARS-CoV virions, of which 
TCIDso (tissue culture infectious dose) was identified as 
10° dilution. All the experiments using the virus were 
carried out in a Bio-safety Level 3 laboratory. 


Preparation of cytosol of infected cells 


Vero E6 cells were infected with SARS virus for 
24 hours. The infected cells were lysed with a solution 
containing 40 mM Tris (pH 8.3) and 0.5% (v/v) Nonidet 
P-40 at room temperature for five minutes. The lysate 
was centrifuged at 8000 rpm for five minutes (Eppendorf 
5415D), and then the supernatant was collected and 
heated at 100 °C for five minutes. 


Peptide synthesis and raising polyclonal antibodies 


The peptide sequences were determined based on the 
prediction of epitopes of the spike protein (gi:29836496) 
and the 3a protein (gi:30275670) as described by Ram- 
mensee et al.” combined with analysis of antigenicity 
(DNASTAR,+). The S1 peptide (amino acid residues 
332-351, TKFPSVYAWERKKISNCVAD) and the S2 pep- 
tide (amino acid residues 758-780, RNTREVFAQVKQ- 
MYKT PTLKYFG) correspond to the spike protein; the 
3al peptide (amino acid residues 176-199, STPKLKE- 
DYQIGGYSED RHSGVKD) and the 3a2 peptide (amino 
acid residues 7-26, FFTLGSITAQPVKIDNASPA) corre- 
spond to the 3a protein. These peptides were synthesized 
using conventional solid-phase chemistry and purified 
by GL Biochem (Shanghai) Ltd. A peptide from the 
sequence of mice IL-12 2 receptor (amino acid residues 
CNRLDLGINLSPD LAESPRFI) was synthesized as a 
negative control. In addition, the nucleocapsid protein 


+ http:/ /www.dnastar.com/ 


was expressed in Escherichia coli and purified as an anti- 
gen for generating a polyclonal antibody.” 

Rabbits were immunized with 200 wg of peptide-BSA 
or protein in 0.4 ml of emulsion mixed 1/1 (v/v) with 
complete Freund’s adjuvant (CFA) supplemented with 
Mycobacterium tuberculosis to a final concentration of 
1mg/ml. The rabbits were boosted twice with freshly 
prepared emulsion of the conjugated peptide or protein 
and Freund’s incomplete adjuvant at three week inter- 
vals. Blood was drawn from the rabbits at seven weeks 
following immunization, the blood was allowed to clot 
at 4°C, and the antiserum was recovered by centrifu- 
gation. The antiserum was then purified with a BSA affi- 
nity column (CNBr-Sepharose 4B) according to the 
manufacture’s instruction (Amersham Pharmacia). 


ELISA analysis 


The sera of 56 SARS patients, who were diagnosed by 
clinical symptoms and laboratory examination, were col- 
lected from different hospitals in Beijing. As normal con- 
trol, the sera of 36 healthy people were collected. The 
sera dilution was 1: 10 (v/v) for ELISA assay. 

ELISA assay was done as described.™ In brief, 96-well 
microtiter plates were coated with the peptides (S1, 3al 
and 3a2, respectively) in 0.1 M carbonate buffer (pH 9.6) 
at 4°C overnight. After blocking with PBS containing 
3% (w/v) gelatin and 0.1% (v/v) Tween 20, the plates 
were incubated with diluted sera from individual SARS 
patients or healthy controls at 37°C for two hours. 
Bound antibodies were detected with horseradish per- 
oxidase-coupled goat anti-human IgG (Bio-rad) and the 
absorbance values were measure by microplate autorea- 
der (Bio-tek) at 450 nm. 


Western blotting 


The cellular fractions or viruses were lysed either in 
the non-reducing loading buffer (50 mM Tris (pH 6.8), 
2% (w/v) SDS, 10% (v/v) glycerol, 0.1% (w/v) bromo- 
phenol blue) or reducing loading buffer containing 
100mM DTT. The mixtures were subjected to SDS- 
PAGE, and then transferred to Immobilon-P membrane 
(Millipore). Immunoblotting was carried out with the 
anti-3al antibody (against the 3a protein) or anti-S2 anti- 
body (against S protein). The proteins were detected by 
enhanced chemiluminescence (ECL, Amersham Pharma- 
cia Biotech). 


“Shot-gun” approach 


The cytosol fraction of infected Vero E6 cells was 
digested with trypsin and analyzed with 2D-LC-MS/MS 
system (ProteomeX, Thermo Finnigan) as described.’ 
Protein identification was performed with BioWorks 
version 3.1(Thermo Finnigan) and the SEQUEST algor- 
ithm. Since the Vero-E6 cells were from monkey, both 
the human database and the SARS database from NCBI 
were merged. The MS results were searched against 
either the combined database or the SARS database 
alone. Protein results were further filtered with Xcorr 
14+ > =15,2+ > =20,3+ > =2.5). 


Mass spectrometry for identification of 
SDS-PAGE samples 


The cytosol fractions of infected Vero E6 cells were 
subjected to PAGE and stained with Coomassie brilliant 
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blue. Gel slices were excised and in-gel digestion was 
carried out with trypsin as described.” The digested 
peptides were analyzed by mass spectrometry (LCQ 
Deca XP Plus, Thermo Finnigan). Data analysis was carried 
out as described for the shot-gun approach. 
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